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REMARKS 

Applicants have amended claim 26 in an attempt to clarify the designation of the ERL 
referred to. What is specifically defined is the portion of the ERL at the N-terminus of the 
second module. It is that portion that is represented by SEQ. ID. Nos.: 12-19. The kind 
suggestion of the Examiner with respect to claims 28-39 has been adopted, thus obviating the 
rejection as to redundancy. No new matter has been added and entry of the amendment is 
respectfully requested. 

At the outset, applicants wish to express their appreciation to the Examiner, not only for 
the withdrawal of a goodly number of rejections previously made, but also for the highly 
professional and clear manner in which the status of the claims has been presented. Applicants 
are especially appreciative of the summary of the issues, and the clear accounting, one way or the 
other, of the rejections previously made. Applicants response to the rejections maintained is set 
forth below. 

Formal Matters 

Enclosed herewith is Figure 3 as a formal drawing. The objection to the redundancy in 
claims 28-39 has been obviated by amendment. 

The Rejection Under 35 U.S.C. § 1 12, Paragraph 2 

Applicants appreciate the recognition that claim 25 adequately clarifies the RAL, which 
are the only subjects of that claim. Therefore, it is believed that claim 25 is sufficiently clear. 
As amended, claim 26 is clear as well. While claim 26 may not define the entire ERL, the 
portions that are specified are clearly defined. 

With regard to the generic issue, applicants do not define a consensus sequence for either 
the ERL's or RAL's because no such consensus exists, as the Examiner correctly points out. 
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However, this does not mean that the skilled artisan would not be able to identify which 
sequences represent these moieties from art-known materials. By examining the consensus 
sequences of the various modules, either of the same PKS or of different PKS, it is apparent 
where the boundaries of the linkers must be. Enclosed is a copy of an article by Donadio, et al, 
Gene ( 1 992) 111:51 -60. Figure 2 of this article sets forth in "stacked" form the sequences of 
modules 1-6 of the erythromycin PKS. With respect to the RAL's, which exist between Ml and 
M2, M3 and M4, and M5 and M6, it will be seen that the ends of the consensus sequences which 
comprise these modules are clearly marked. The amino acid sequence appearing between these 
consensus regions is precisely that, in every case, set forth in Figure 3 as the appropriate 
intrapolypeptide linker. (An extra copy of Figure 3 is included as Exhibit B to facilitate 
comparison.) 

Figure 3 of the patent shows the N-terminal portions of the interpeptide linkers that are at 
the starts of module 3 and module 5. Unfortunately, the Donadio article does not show the 
complete upstream sequence from the KS domain at the start of these proteins as indicated by the 
position numbers at the right of the figure; there is space for 100 amino acids in each line and the 
numbers at the ends of these lines are 126 and 122, respectively. However, it will be seen that 
the sequence immediately preceding the KS domain, in each case, matches that presented in the 
figure. The additional upstream sequence is shown in Exhibit C-l ; the complete upstream 
sequence known for erythromycin matches that described herein. The nature of the sequence 
could, of course, be obtained by reference, for example, to PCT publication WO 93/13663 which 
would provide this information, or to the GenBank deposit referred to in Donadio. Similarly, 
Figure 3 does not provide information on the C-terminal portion of the interpolypeptide linkers, 
but this too could be obtained from the PCT publication and is shown in Exhibit C-2. Again, the 
figure in the Donadio paper is not quite complete - for example, DEBS-1 that would contain the 
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C-terminal portion downstream of the ACP of module 2 contains 3491 amino acids while the 
depiction in Donadio shows only up to position 3418. The remainder of this sequence can also 
be derived from the GenBank deposit referenced in the Donadio paper, for example. 

Exhibit D shows additional RAL sequences for rapamycin and erythromycin; the 
N-terminus of the KS domain is readily determined in all of these PKS proteins as an acidic 
amino acid (E or D) followed by proline, followed by a hydrophobic amino acid (L, I or V). The 
linker sequences in the rapamycin stack are much more homologous than the erythromycin 
linkers. By comparison of the termini of the KS domain, however, between the erythromycin 
and rapamycin sequences, the linker sequences for rapamycin can be determined. With respect 
to the intramolecular sequences, M3rap is that designated in the Figure ACP02/KS3; that 
designated M4rap is that designated as ACP03/KS4rop; that designated Mlrap is that set forth 
as ACP06/KS7. (Exhibit D does not depict any REL sequence.) 

With respect to the comments regarding GenBank Accession No. M63676, the 
approximate values of module 1 and module 2 should be taken as simply "approximate." As 
shown by the consensus sequence in the Donadio article, which depicts the sequence of M63676, 
the amino acid sequences presented in Figure 3 are correct. 

In view of the foregoing illustration, it is believed that this basis for rejection may be 
withdrawn. 

The Rejection Under 35 U.S.C. $ 1 12, Paragraph 1 

All claims were rejected on this basis for an asserted lack of written description. The 
Examiner is, of course, correct that if a composition of matter is claimed, there must be an 
adequate description of the components that are required. Applicants respectfully submit, 
however, that in view of the showing set forth above with regard to the rejection under § 1 12, 
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paragraph 2, this is indeed the case. By simply consulting the known sequences of any cloned 
PKS, the relevant structure of the linkers can be ascertained in precisely the same manner as can 
the naturally occurring catalytic domains of the modules themselves. 

The attention of the Office is drawn, for example, to the holding in Amgen, Inc. v. 
Hoechst, Marion Roussel, Inc., 57 USPQ2d 1449 (DC Mass. 2001), 65 USPQ2d 1385 (Fed. 
Cir. 2003). As set forth in that case, the written description requirement does not necessarily 
require spelling out sequences of amino acids or nucleotides, provided the structural features can 
be ascertained in a sufficient manner to permit one of ordinary skill in the art to construct them. 
This is clearly the case here since one has only to review published sequences of modular PKS to 
define the linkers that are present therein/ The fact that these linkers happen to be disparate in 
structure from each other is an accident of nature and does not preclude the ordinary skilled 
artisan from ascertaining workable embodiments with ease. 

In view of the foregoing, it is believed that the written description requirement is met. 

CONCLUSION 

As the lack of consensus among intermolecular and intramolecular linkers is an accident 
of nature and not the fault of applicants, and as it would be well within ordinary skill to obtain 
workable linkers for construction of the claimed compositions, applicants believe that the 
pending claims, claims 23, 25-26 and 28-39 are in a position for allowance and passage of these 
claims to issue is respectfully requested. 

In the unlikely event that the transmittal letter is separated from this document and the 
Patent Office determines that an extension and/or other relief is required, applicants petition for 
any required relief including extensions of time and authorize the Assistant Commissioner to 
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charge the cost of such petitions and/or other fees due in connection with the filing of this 
document to Deposit Account No. 03-1952 referencing docket No. 300622004600 . 



Respectfully submitted, 



Dated: 



May 5, 2003 



By: X^ZUr. 




Kate H. Murashige 
Registration No. 29,959 

Morrison & Foersterixp 
3811 Valley Centre Drive, 
Suite 500 

San Diego, California 92130-2332 
Telephone: (858)720-5112 
Facsimile: (858) 720-5125 
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EXHIBIT A. - VERSION WITH MARKINGS TO SHOW CHANGES MADE 



In the Claims : 

26. (Twice amended) The hybrid modular PKS of claim 23 wherein the portion of 
the ERL at the N-terminus of the second module is selected from the group consisting of 
[M3 ery, M5 ery, M4 rif, Ml rif, M8 rif 9 M9 rif, M5 rap, and Ml 1 rap inter-module linkers 
wherein the portions of said modules coupled to the N-terminus of the succeeding module are 
represented by] SEQ. ID. NO's: 12-19, respectively. 

28. (Twice amended) The hybrid modular [polyketide] PKS of claim 23 which 
contains ery modules 1 and 3 through 6 inclusive and tylosin module 2, and wherein said 
polyketide chain is transferred from ery module 1 to tyl module 2 and then to ery modules 3 
through 6 inclusive. 

29. (Twice amended) The hybrid modular [polyketide] PKS of claim 23 which 
contains ery modules 1 through 5 inclusive and narbomycin module 6, wherein said polyketide 
chain is transferred from ery modules 1 through 5 inclusive to nar module 6. 

30. (Twice amended) The hybrid modular [polyketide] PKS of claim 23 which 
contains modules 1 and 3 through 6 inclusive of ery and modules 2-3 of tylosin, spiramycin or 
niddamycin, wherein said polyketide chain is transferred from ery module 1 to modules 2-3 of 
tylosin, spiramycin or niddamycin and then to ery modules 3 through 6 inclusive. 

3 1 . (Twice amended) The hybrid modular [polyketide] PKS of claim 23 which 
contains modules 1 through 3 inclusive of tylosin, spiramycin or niddamycin and modules 3 
through 6 inclusive of ery, and wherein said polyketide chain is transferred from modules 1 
through 3 inclusive of said tylosin, spiramycin or niddamycin to ery modules 3 through 6 
inclusive. 

32. (Twice amended) The hybrid modular [polyketide] PKS of claim 23 which 
contains a module of tylosin, spiramycin or niddamycin and modules 1-2 and 3 through 6 



sd- 122052 



11 



Serial No. 09/500,747 
Docket No. 300622004600 



inclusive of ery, wherein said polyketide chain is transferred from ery modules 1-2 to the tylosin, 
spiramycin or niddamycin module and then to ery modules 3 through 6 inclusive. 

33. (Twice amended) The hybrid modular [polyketide] PKS of claim 23 which 
contains modules 1 and 3 through 6 inclusive of ery and module 5 of tylosin, spiramycin or 
niddamycin having the enoyl reductase catalytic activity inactivated, wherein said polyketide 
chain is transferred from ery module 1 to module 5 of tylosin, spiramycin or niddamycin and 
then to ery modules 3 through 6 inclusive. 

34. (Twice amended) The hybrid modular [polyketide] PKS of claim 23 which 
contains ery modules 1 through 4 inclusive and 6 and module 6 of spiramycin or niddamycin, 
wherein said polyketide chain is transferred from ery modules 1 through 4 inclusive to module 6 
of spiramycin or niddamycin and then to ery module 6. 

35. (Twice amended) The hybrid modular [polyketide] PKS of claim 23 which 
contains module 1 of FK-506 or 520 and modules 2 through 14 inclusive of rapamycin, wherein 
said polyketide chain is transferred from module 1 of FK-506 or 520 and then to modules 2 
through 14 inclusive of rapamycin. 

36. (Twice amended) The hybrid modular [polyketide] PKS of claim 23 which 
contains module 1 and 1 1 through 14 inclusive of rapamycin and modules 2 through 6 inclusive 
of FK-506 or 520 wherein said polyketide chain is transferred from module 1 of rapamycin to 
modules 2 through 6 inclusive of FK-506 or 520 and then to modules 1 1 through 14 inclusive of 
rapamycin. 

37. (Twice amended) The hybrid modular [polyketide] PKS of claim 23 which 
contains module 1 of rapamycin, modules 2 through 7 inclusive of FK-506 or 520 and 
modules 12 through 14 inclusive of rapamycin, wherein said polyketide chain is transferred from 
module 1 of rapamycin to modules 2 through 7 inclusive of FK-506 or 520 and then to 
modules 12 through 14 inclusive of rapamycin. 
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38. (Twice amended) The hybrid modular [polyketide] PKS of claim 23 which 
contains module 1 of rapamycin, modules 2 through 8 inclusive of FK-506 or 520 and 
modules 13-14 of rapamycin, wherein said polyketide chain is transferred from module 1 of 
rapamycin to modules 2 through 8 inclusive of FK-506 or 520 and then to modules 13-14 of 
rapamycin. 

39. (Twice amended) The hybrid modular [polyketide] PKS of claim 23 which 
contains modules 1 through 10 inclusive of rapamycin and modules 7 through 10 inclusive of 
FK-506 or 520, wherein said polyketide chain is transferred from modules 1 through 10 inclusive 
of rapamycin to modules 7 through 10 inclusive of FK-506 or 520. 
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1992 Elsevier Science Publishers B.V. All rights reserved. 0378-1 1 19/92/505.00 
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SUMMARY 

Localization of the enzymatic domains in the three multifunctional polypeptides from Saccharopolyspora erythraea involved 
in the formation of the polyketide portion of the macrolide antibiotic erythromycin was determined by computer-assisted 
analysis. Comparison of the six synthase units (SU) from the eryA genes with each other and with mono- and multifunctional 
fatty acid and polyketide synthases established the extent of each 0-ketoacyI acyl-carrier protein (ACP) synthase, acyl- 
transferase, P-ketoreductase, ACP, and thioesterase domain. The extent of the enoyl reductase (ER) domain was established 
by detecting similarity to other sequences in the database. A segment containing the putative dehydratase (DH) domain 
in EryAII, with a potential active-she histidine residue, was also found: The finding of conservation cf a portion of the 
DH-ER interdomain region in the other five SU, which lack these two functions, suggests a possible evolutionary path for 
the generation of the six SU. 



INTRODUCTION 

Erythromycin, a macrolide antibiotic produced by S. ery- 
thraea, is composed of the polyketide-derived 14-mem- 
bered macrolactone ring, 6dEB, to which are attached two 
deoxysugars, cladinose and desosamine. Synthesis of 6dEB 
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involves six elongation cycles that resemble the steps in 
fatty acid synthesis. It has recently been shown that 6dEB 
synthesis requires three adjacent eryA genes encoding large 
multifunctional polypeptides and that the eryA cluster con- 
sists of six modules (repeated motifs), each encoding a 
different SU specific for one of the elongation steps (Cortes 
et aL, 1990; Donadio et aL, 1991). We proposed that the 
genetic organization of eryA and the steps in the biochem- 
ical pathway of 6dEB are colinear (Donadio et aL, 1991). 
A scheme of the enzymatic activities leading to 6dEB is 
shown in Fig. 1. 

In previous work, the FAS-like activities ACP, AT, KR 
and KS in the multifunctional er^-encoded polypeptides 
were identified from the presence of 'signature sequences' 
found at the active site of the functional domains (Cortes 
et al., 1990; Donadio et aL, 1991). However, signature se- 
quences have not been assigned previously to the DH and 
ER functions, both of which have been poorly character- 
ized biochemically. Multifunctional FAS systems are or- 
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Fig. 1. Scheme of 6-deoxyerythrono!ide B synthesis (Doncdio etal., 1991). The top portion shows the three eov4-encoded polypeptides containing SU 
1 through 6. Enzymatic activities belonging to the first and to the second SU for each polypeptide are represented by empty and shaded circles, respec- 
tively. The bottom portion illustrates the role of each SU in the synthesis of 6dEB, where the C 2 units in the ring introduced by odd- and even-numbered 
SU are represented by dashed and continuous lines, respectively. 



ganized in discrete functional domains which can be re- 
solved upon limited proteolysis (Wakil, 1989). The eryA- 
determined SU can be assumed to consist of linear domains 
on the basis of their similar organization to the animal FAS 
systems (Cortes etal., 1990; Donadio etal., 1991). Here, 
by comparing the 6dEB PKS domains with each other and 
with other multi- and monofunctional FAS and PKS sys- 
tems, we subdivide each eryA -encoded polypeptide into its 
constituent domains and propose a location for the DH 
and ER domains in EryAII. Conclusions similar to those 
reported here have been independently reached in the lab- 
oratory of P.F. Leadlay (personal communication) for the 
domain organization of EryAII and EryAIII and by Wit- 
kowski etal. (1991a) for the domain organization of rat 
FAS. 

RESULTS AND DISCUSSION 

(a) Extent or KS, AT and ACP domains 

The six SUs are organized in pairs in three large deduced 
aa sequences (Fig. 1). Within each polypeptide, the end of 
the N-terrainal SU was arbitrarily placed where the se- 
quence C-terminal to the first ACP domain began to di- 
verge from that of the second ACP, located toward the 
C-tenninal end of the polypeptide. In this way, SU 1 
through 6 were determined to be 1975, 1516, 1484, 1973, 
1480 and 1690 aa in length, respectively. The domain order 
f each SU is KS, AT, KR and ACP. SU1 has additional 
AT and ACP domains at its N terminus, SU4 contains DH 



and ER domains between AT and KR, and SU6 contains 
a TE at its C-terminal end (Fig. 1). We have used the dif- 
ferent compositions of the SU as a first approximation in 
establishing the extent of some of the domains and con- 
firmed the results obtained by comparison with the mono- 
functional PKS proteins from the gra (Sherman et al., 1989) 
and tcm (Bibb et al., 1989) clusters. In this way, the begin- 
nings and ends of the KS domains could be easily assigned 
to the highly conserved aa motifs d(e)PiAiVgmaCR (upper- 
case letters refer to invariant residues) and GTNAHvIeE, 
respectively (Fig. 2). Comparison of the sequence of the 
first AT of module 1 (AT-S) to the previously aligned six 
other ATs clearly indicated the aa motif vfVFPGQGaQW 
as the likely beginning of the AT domains (Fig. 2). In a 
similar way, their end could be placed where AT-S is seen 
to diverge from the other ATs, a few residues after the 
highly conserved aa motif GVavdwxxa (Fig. 2). Com- 
parison of the eryA ATs with the only monofunctional se- 
quence available to us, a transacylase from Streptomyces 
gfaucescens (R. Summers and C.R. Hutchinson, personal 
communication), confirmed the assignment of the N- 
terminal end and showed significant matches, albeit with 
two long gaps, up to the pvxLPt motif, just beyond the 
C-terminal end of the domains established by comparison 
of the eryA ATs. 

When the segment encompassing the ACP active-site aa 
motif LGxDS from the first ACP of SU1 (ACP-S) was 
aligned with the six other eryA ACPs, the ends of the ACP 
domains appeared to coincide with the aa m tif lAxhlxa, 
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situated 38 aa after the active-site Ser (Fig. 2). Their starts 
could not be easily established by this criterion, however, 
since conservation between ACP-S and the other six ACPs 
began a few aa before the active-site motif. Comparison 
with raonofunctional ACPs from gra and fc/w, however, 
indicated that significant matches began with the motif 
lAglsxxe and ended with the motif Glrlpxtlv, 15 aa before 
the end established by comparison of the eryA ACPs alone 
(Fig. 2). This would place the N-terminal end in all of the 
ACP domains, except ACP-S, 45 aa upstream from the 
active-site Sen 

(b) Conservation of KS, AT and ACP in multifunctional 
systems 

The aligned KS, AT and ACP domains from the six eryA 
SUs were compared with those found in other multifunc- 
tional systems, FAS from chicken (Yuan etaL, 1988; 
HolzeretaL, 1 989) and rat (Amy et ah, 1989), and the PKS 
6MS AS from Pemcitlium patutum (Beck et aL, 1990). As 
expected, a higher overall degree of similarity was observed 
in the intra- than in the interdomain regions among the 
multifunctional systems analyzed (Fig. 2). The nine KS 
domains examined shared 71 invariant aa residues (out of 
425 aa) and, considering conservative substitutions, were 
similar for 205/425 residues (Fig. 2). In contrast, AT and 
ACP domains exhibit lower conservation (Fig. 2; see also 
below). It is tempting to speculate that the basis for the 
apparently high sequence constraint in the KSs from dis- 
tantly related organisms is that, in addition to catalyzing 
the condensation of the acyl chain with the extender unit 
charged on the ACP to form 0-ketoacyl-ACP (Wakil, 1989), 
the KS is also responsible for the transacylation of the 
elongated acyl chain from the ACP to its own active-site 
Cys residue. 

Invariant aa accounted for 26/345 residues in the ten AT 
domains examined, and approx. 30% of the ten sequences 
involved conservative substitutions (Fig. 2). It is notewor- 
thy that, in addition to the segment around the signature aa 
sequence GHSxG, two additional segments, each contain- 
ing an invariant His, are highly conserved in the ten ATs. 
Serine proteases are known to contain active-site Ser and 
His residues, distant in the primary structure, but brought 
into close proximity in the folded protein (Hess, 1971) 
Since the types of reaction carried out by ATs and serine : 
proteases are believed to be similar (McCarthy and Hardie, 
1984), the finding of invariant His in the ATs distant from 
the active-site Ser suggests a similarity with serine pro- 
teases also in catalytic mechanism. 

The ten ACP domains examined exhibit only one invari- 
ant residue outside of the LGxDS motif. The 30-aa N- 
terminal segment of the ACP domains from the SU shows 
some conservation with mono- and multifunctional pro- 
teins, except for ACP-S (data not shown). In their inde- 



pendent study of domain organization of rat FAS, Wit- 
kowski et al. (1991a) have placed the N-terminal end of the 
ACP domain approx. 10 aa C-terminal to the end sug- 
gested here. Thus, the ACP-S domain lacks in its 
N-terminal portion a segment of at least 15 aa when com- 
pared to the other ACPs. This apparent anomaly may re- 
flect a functional difference between ACP-S and the other 
ACPs. According to the model proposed for 6dEB syn- 
thesis (Donadio et al., 1991), the sole role of ACP-S con- 
sists of receiving the propionyl starter unit from AT-S and 
of transferring it to the KS of SU 1. Its function would thus 
be limited to acylation/deacylation, and this ACP would 
not be employed in carrying the 0-ketoacyl chain through 
the appropriate processing steps, as do all other FAS and 
PKS ACPs known. 

(c) Extent of ER, DH» KR and TE domains 

Only eryA mbdule 4 encodes DH and ER functions, 
which, to date, have only been tentatively located in FAS 
systems (Tsukamoto and Wakil, 1988). A 400-aa segment 
unique to SU4 and believed to include the ER domain 
(Donadio et al., 1991), was used to search the databases. 
Surprisingly, the best matching sequences found, aside from 
the rat and chicken FASs, were from structural proteins of 
higher eukaryotes, C-crystallin from guinea pig lens (Ro- 
dokanaki etaL, 1989) and the membrane protein VAT-1 
from Torpedo californica synaptic vesicles (Linial etaL, 
1989). The similarity of these two proteins to alcohol de- 
hydrogenases has already been reported. In addition, the 5 ' 
end of an URF divergently transcribed from the dnaB gene 
of Salmonella typhimurium (Wong et al., 1988) was detected 
by this search. Alignment of these sequences indicated that 
the ER domain is likely to extend for approx. 330 aa and 
contains 19 invariant and 90 conserved aa residues (Fig. 3). 
In particular, the sequence LxHxg(a)xGGVG, proposed as 
the NADPH-blnding site for the rat ER (Amy et al., 1989; 
Witkowski et al., 1991a), appears to be highly conserved in 
the six sequences examined. It should be noted that data- 
base searches also indicated similarity between the ER 
domain and alcohol dehydrogenases, but this similarity is 
limited mainly to the putative NADPH binding site (data 
not shown). Although no enzymatic role has been assigned 
to C-crystallin or VAT-1, the high similarity detected among 
the six sequences suggests a possible present or former role 
for the two monofunctional proteins in reducing double 
bonds that lie o,0 to a carbonyl group (Piatigorsky and 
Wistow, 1991). 

The ca. 500-aa segment defined by the end of the AT and 
the beginning of the ER domain in SU4 showed some 
similarity to the corresponding segments from rat and 
chicken FAS (Donadio etaL, 1991), as well as with an 
approx. 200-aa stretch located between the AT and KR 
domains of 6msas (data not shown), which is believed to 
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- €00 
-2071 

- 126 
-1584 

- 122 
-1581 

- 700 
-2171 

- 225 
-1684 

- 222 
-1680 

- 800 
-2271 

- 325 
-1784 

- 322 
-1780 

- 898 
-2371 

- 425 
-1882 

- 422 
-1880 




!q-~|AG§| EVFTE - 5RQG * -S P g§ - > P 
GVFVD - 5RQR J-Aftf!Hs-|A 



GMLVtfjSRMN ^f^jc^A ISACStfipCli 

•GAG 
SAP 
SKA-ttf? 
SAE- 



YTFVC 
GAFTE 
EVFTE 



3TQR 
&SQG C3— J 
SRQG C 



C-AS&--C- 



•Afig|-C-P 
-A«|§-C-A 




v-cp 

i-ci 



oQAsfHi 




uIaMsg 



KAIA- 
■I B£C- 

iajs- 

IB*A- 
WAJ A- 



t:p 



ADptoHpI 
dGA-URVlf 

iEPs-\fcWrrr 
veaivH 

igaH 



J £ — LAT 



— LOT 

- IAT 

- LST 



LAT 



RDRECp 
KS-GSSC 
RD-DADF i 
QD-DR..J 
AE-DPDI i 
■K3-GSSC ■ 




3TMqaa/0v 

™* — r 



— ?vr - 



*£fl| h AfA HtWH* 

§1 * 

Hp — '? 



■VA 



VA - 



If V\i 



A |G^lfpP 

tVA 



Ir/ll 



^lfflHA 
PK- 
M 

R! 

■felt 




3 



1 £ LI DEP 
-tULIDGV 
£-C\ALIAGN 
-p VE VIREE 
\SWSQA 
3}-k V E L? BAV 




EAPQW EGERVEAGD Wfl^/LSAb 

-P-EPEPVPC} PRR..MLPAT GW-VVL-AH 

-A- ERE HRETTAHDG RPV-IW-A» 

-A- A EQEAARTER GPI-FVL-Ga 

-A-EAD EPEPAPDSG PV . - 1 VL-G I 

-P-EPEPLPE PGPVGVLAAA NSV-VLL-AR 




RLAAHLREHP GQDPRDIAYS 
RLADHLAAHP GIAPADVSWT 
QIAELL.ERP DADLAGVGLG 
ALAEHLROTP ELGLTDAAWT 
RLADHLAREP RNSLRDTGFT 
R LLES..AVDD SVPLTALASA 



■CPA 



--G 




AjAjLRVLDGlj? 
Eft/HR-] 
EA7RG-RE* 
CV^E-DAI 
C R LAG-RAV » 
C L &GQ-RAV i 



A ^V-r^r|- 

a flo-f naj* 



PEQSRPUE^V 
ALSTE- ■ 
AP3LE- - 
APGLD-I- 
PDPHE- 
ARPLD- 
APSLE- 




GESRMEjAAA^ 
DTSP' 



SV.P- 



AAU RE-ADMEPH I DI EVIE FI R 
ESI AE-DAMIjSEV ^SVSE|vi|. 



DACAIUFEPV IDVTLACVI 



RA-DEEK APM CDVKVSIVI, 



PV.P 

SSSP*f]-GF|l 

ESSE- \ -ES H SR-AE^ljSPH TbWKLLEfW 

RESQ-I-DSI RD-ERf I APH VDVSLTTLI 



ESQ AE-DA\(lJSEV AgEjSASE]VI 




SAl{H£lREjSlP 
V AI R£ Rll AT 
V APRS RIVRA 
VVCRS RIMS 
UAIRSQIIRE 



AVRSRV 
UAIEUKI 



I VGK G 



IAR 
IRA 



iagf:c-i> 



^PG^KGM!I 

V)AGP:S^t£V 

IDDCCG^SV 
IGGCCC^ASE 



ALSADEIEPR 
A...APAGEV 
RGGRSDVEKL 
ALGEAAVRER 
GASRDELETV 
GLGTEQAABR 
AAPGERARAL 



IA..RNDDEU 
RA..RIGDRU 
LADDSHTGRI 
LR..PMQDFI 
LA..RHDGRU 
IG..FFAGAC 
IA..PHEDRI 



- 204 
-1184 
-2659 

- 710 
-2164 

- 701 
-2166 



viASVtrc 



PF$ VIA 

wk 



Elrft- 

EVffA- 

SVrA-hCi-FjS 
A\ta4-' 
SI-S' 

SM?^-t£ 



C -PS 

■Ps wk 
■ss wk 



fePEPjvl ARRVQELSAE Gf^RAQV 
i)SDEl DRLVASCTTB 
i)AQf A REFLEYCEGV 
■fePGAllJ RAFSEDCAAE 
DEFFAEAEAR 
DELIAECEAE 
AELVARCEDE 



hPTAI 1] 
feSGE 



C I RAKF I \- 
ClRARAip 
CIRVPX 
ENKPRF ijA- 
C I TARF I ? 



C IRAKI I ? 




PF 



WFHPGGSEVP Fpjft^TCjGjAV 
EC E HPLPGFV 
VtRDEll^QAjljA .CITPRRAE7 
VtREE(l{L.E7ltr GI I APRPARV 
.1ITAVRGSV 
C 1 SPVSADV 
CISARRAAI 



I AEC f. RS A I A 
IRDAIHAEIG 



I EDF I AAE 1 G 



.TELA 



PEE 
TFH- 
PIU- 
AIV 




. DTRELVAC i 
TQPDELDAC • 
LDGTELDAC - 
MDGTELDAP * 
ZDTSAHDAS ■ 
IDTATMDTA- 
RDGADMGPF - 



KRF S£ RLI V R - 



-YF KI -Rl 



-Yt HI 



-YE HI -El r 
-YFfH-RE A, 
-YAjHI -EC *R 
t-SC^R 



303 
R -128fl 
YFfM2|-HEf:E -2758 
R - 809 
2263 
800 
2265 



E5EA] RSftLE VGPGIF\EA£ 



-iq-\(EUWE gHyrt)-] 



\GAI 



ro c- 



YA3 

-bri-\{lPfl^E £{- YM -]Jrf/- 
-3S-MRCIVE C-PD1-\|^ 
-UE-1RCI&E A-FW-vlv- 
- 3E -V SA ft VA C-HA1-\^«- 



P^VllAAAfb 

fr -ii tame: 
e—vi Assjva 

Er-W VQAVS 
E I -VI LHmV Z 
Pf-VITVCllE 

ph-vi| taaMq 




F5QGGMRREL 
-3DGSLAIEG 
OAGDADPEL 
- QGGDLSAEti 
-SQSGPHEEL 
- 3RGGLAC E H 
-3TAE.EHI 



LAAAQAFTC G 

EALSR-FW | 

talad-hte| 
rsmat-hvs | 
rnllr-hvi | 
talge-yac I 

AELAR-KV) i 



VTTAAYD 
IWESVHL 
WEAVL. 
-{[jljRWDVAL. 
LRPAV. 

SPAF. 
IWPJJVF. 



LEPKPVARRS TE VDEVSALRY RI EWRPTGA GEP ARLDGTWLVA 

L. . . . LPDRT TP R-ELDGWF- RV D-TEVPR SEP AALRGRW-W 

LOPAAPAAA. S-ELA. . RV S-TPIEK PES GNLDGDM-W 

PRPHRPADVS AL^— 152 88" ► Y-GLAEQG-E YGPSFQALRA A-RKDDSVYA EVSIAADEEG YAFHPVL-DA 

L...PIPTGG RA R-EDDDWR- QV V-REAEW ES ASIAGRV-LV 

L APE. . .V S-QLADSR- RV D-RPL.A TTP VDLEGGF-V . 

□ O E=3 a Q Q □ C=3 



DVGPHPAL 

GTGARRVPLP 

.GRAGXVD— 

.PGAAPFA— 

.AGGRPAE— 

. ADARP VE— 

.PAAPPVA— 

■ m m ■ 

KYAGTADBTS 

VPEGHEEDGW 

T.PLISPE.W 

VAQTLSLGAL 

TGPGVPSB.L 

. .HGSAPESL 



TT^EfeREFf^H - 
G-— (OGR- 
T- - 



EHQ-E- 
V4~ QFO- Y - 

TAA.RE 

TVEVRS. . . • 
TEMLCE. . . . 
GEPGGGKLPF 
SDAIRS.... 
TS 



388 
-1381 
-2853 

- 906 
-2358 

- 898 
-2355 

-1439 
-2908 

- 957 
-2592 

- 952 
-2399 
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1 ALESA GARVRELWD ARCGRDELAE RLRSV.GE.V AGVLSLLAVD EAEPEEAPLA LASLADTLSL VQAMVSA ELGCPLW. . . 

2 ALAEA -A E PEVTRG.VGG LVGDCA G-V-LLALE GO.... . .GAVQTLV- VRELDAE GIDAPLW. . . 

3 AIHAN -GRALRCEVD TSASRTEMAQ AVAQAGT.GF RG-L-LLSSD ESACR..PGV PAGAVGLLT- VQALGDA GVDAPVW. . . 

4 AWNTVTLHAS -ATSVR.WA TPAGADAMAL RVTDPAGHLV AT-D-LWRS TGEKWEQPEP RGGEGELHA- DWGRLAEPGS TGRWAADAS DLDAVLRSGE 

5 GLEQS -ATVLTCDVE . . . SRSTIGT ALEAADTDA1 ST-V-LLSRD GEAVD..PSL DA LA- VQALGAA GVEAPLVJ... 

6 AVEKA -GRV...VPV ASADREASAA .LREVP.GEV AG-L-V HTGAATHLA- HQSLGEA GVRAPLW. 



1 TVT ESAVATGPFE 

2 TV- FG — DAG.SP 

3 CL- QG — RTPADD 

4 PEPDAVLVRY EPEGODPRAA ARHGVTWAAA LVRRHLEQEE LPGATLVIA- SG— TVSDDD 

5 VX- RN — QVADGE 

6 LV- SR--ALGESE 



RVRNAAHGAL WGVGRVIALE 
♦VARPDQAKL H-LGQVASL- 
DLARPAQTTA H-FAQVAGL- 
SVPEPGAAAM W-VIRCAQA- 
LV.DPAQAHV G-LGRWGI- 
.PVDPEQAHV W-LGRVMGL- 



RHLAAWSGG A.GEDQLA1R ADGVYGRRWV RAAAPA 
GRLT-VLAG. . . SEDQV-V- ADAVRAR-LS PAHVT. 
RLLV-VLRGG GRAEDKL-V- DGRLHGR-W RASLPO 

.MLP-V PDNPQL-L- GDDVFVP-LS PLAPSA* 

RSLA-VL.AD PRGEEQV-I- ADGIKVA-LV PAPARA 
EAFV-CLGAD G.HEDQV-I- DHARYGR-LV RAPLG. 



-315 aa- 



TDDE.WKEjTG 
ATSEYAV- 
SGSRSWT- 



H- 

^TQPAPVd-D- 
ARTR.W5 -|r- 
TRESSWE ■ 



TVLVJGC 



A- 



n 



TGG 
IPG 



-I- 
-V- 
-V- 

-A-V -^C I S A 



a o 

NPAVWGGLVD 
RGPRWTGLV- 
LPGRWGGW- 
SPDRF.VLL- 
' QPGRWGGLV- 
TPERW GGLV- 

vBGCfSHRVf). 

AEVft-W^ 
m DC I V - . 

AUVft-WS 
GUVfi-*:. 



VPAGSVAELA 
LPEMPDPELR 
LPESVDDAAL 

TDAEPG 

tVDA.DAASI 
LPAEPAPGDG 



-1516 
-2964 
-1033 
-2691 
-1021 
-2459 

-1569 
-3015 
-1086 
-2766 
-1072 
-2511 



APJS^ffiEE 
ADFf*EF-V- 



1642 
3086 
1161 
•3171 
•1145 
ARCtf£li4vH -2584 



ARSrPEE-V- 



vSKsEpdadg 



VERRGPOTEG 



AGE'/jVAELEA 
VGD-TAELl. n 
JAGACP GDD-LAAVEE 
IAPG SDE-RAEIED 
IAPG ASE-REBLTA 
AAE-BAELVA 



V^jCjVDAPG 




V S SREPVRE 
AAA 

- 1 fl DRDALSA 
VflDRARLEA 

V P DREQLSK 



EISAVFHPJ A 



I uiftLV- 



DRESVRE I LGGI . GDDV 

tVHGLIEQGD vMP|SW 

LRE FLGDE 

ILDGL...PR 
ULAAERAEGR 

iJleelrgcgr HvEfrw- 



?*G 
CT 



FIIGW-BfG 
T^VM-fliPG 
'-1?G 



TLDDGTfilDTL 
LPQQV2 I MDM 
LTKFGi 1 SEV 
VLADGUT3I 
VSTSTP t DDL 
VPESRPIHEI 



J^fajG^jWfiE 

*^fiGGAW^D 

a|iauav^d 
aIvlaajw^j- 

GE..tl|EdVteA ArVlGARI-C 



TGEP ] EF A SR 
DEAAEDEVVA 
APEEEAE1IA 
DEPAVECVU: 
TEAEETEIAD 



eQtreldlta 

-LCSDAEL. . 
- V LGDRAVER 
-LTANTGLSF 
-LCPDLD. .A 
- L CPDAET . . ' 



-1741 
-3184 
-1248 
-3268 
-1243 
-2680 



HWE$&ftSA 
ILIF^G-GV 

ivips^fA-sv 

EVIBJ&K-GV 
EVIF>:-G-GV 



Ej^GlHcfiE 
Y G SARC C A r ft 
Y G SACK A A » A 
IASPGCCV-A 
tGSPGIfiSf A 
«GSANICA*S 



pj^^rsGp 

AGSM-IAF- 

agsm-4a|l- 

AASE£- 
AANU- 
AASM 



CCF- 



rAL- 



RE R-Gf ? L PA 
EEB-AP^RSS 



- RER-SEf 

cWl- 



AlR-TF?t?A 



Hlg-AEy^ 




i C TV AG 
■CL--A 
■1P--L 
:CL--3 

-CL--G 
■ C A- - 3 



ADRFRRiflVI EMPPETftCRA (L^NAflDRAEV CPIVIDVRWD 

AWVTDVDWP 

YXREFHlR SLSADFf-fcRT fHEjRVHAAGPV 5VAVADVDWP 
jlAL [FqSjJ- RRGGE WFPLSINRS 
Jdhcqt SVSWDMDRR 
CV5IADV0WE 



SGMAEG. .AV 

GGH.TGDEEA VSFLREF^fVR AMPVPF)-|LAA 
PGGAVD..DG 
ASEMTSGLG. 

QNHAGD. .EG GEYLEtS^LR AMDPDR}-|VEE 
EGHATGDLEG 



DRXART - \tk ALPTEP 



LTRF-LR PKAPEF- IRA 



L¥ 11 

LI QA -JDNGDT 



RFLLAYTAQR 
AFAESYT-AR 
VLSEGFA-TR 
ALR. . .R-EF 
RFVELFT-AR 
AFAVGFT-AR 



PTFlLEbEIDD ARRAA...PQ APAEPRVGA. LAS LP APE RE 
PRfUDRIVT . .T APSE RAG E PETES LRDR L-GLPRAERT 
PT? LF AELAG RGGQAEAEPD SGPTGEPAQR L-GLSPDEQQ 
VPI VIRGMVR AKLRAAGQAE A . AGPNWDR L-GRSESDOV 
HRELFDEIAG ARAEA...RQ SEEGPALAQR L-ALSTAERR 
PRE LI DELVT PAVGAVPAV Q-APAREHTS 




DAEA1ER ELXIDSVIAA 



LA-V 



*ADQA- A 
lTTP-K 



INVRRA-S 
I PERRA - K 
IJDRDRA - R 



•r E rt? M I 



CIFUVSAAI 
E-:MHGAAT 
F - 1 HI I NAAT 
A - 4 KE I SAST 
E-tHIIGTAT 
C-^W5IAAVT 
C-tHCIQQAT 



gre^nially dhptpralee alaag 
-veiptttvf d — dvrti - a hlaaeJiIggat 
-leipstlvf d— nasav-g fldae-gtev 
-leipaslvf d— tvtai-q hlraf-vg.. 
-vi1pstlvf d — tp1a\ - b hlrdf - faas 

-VFEAATWF D — TITRI - D HYLEF-VGAA 
-LAIPATLVF E — TVRRI-D HIGQC - DSGT 



GAEQAAPATT 
RG.EAPSA 
DADQAAVRW GA 
FAVDIGDPLD EXE 
EAEQAPALVft EVP 
P AREAS SALR DGY 



- 486 
-1975 
-3418 
-1484 
-3505 
-1480 
-2909 



-1839 
-3283 
-1344 
-3366 
-1341 
-2777 

- 461 
-1935 
-3381 
-1444 
-3462 
-1437 
-2866 



t> * Alignments or the six eryA SU. The symbols on the left margin refer to the particular SU, with S referring to AT-S or ACP-S, as appropriate. 
Numbers on the r**t refer to the aa sequence position at the end of each row in EryAI (for I, 2 and S on Ieft) t in EryAII (for 3 and 4) and in 
EryAIII (for 5 and 6). Sequences for EryAI and for EryAII and -III are from GenBank, accession Nos. M63676 and M63677, res|>ecUvelMavanant 
aa residues in the six SU arc marked by dashes. Dots refer to computer-introduced gaps to maximize alignments. Shaded boxes refer to w residues in- 
variant m the six (or se^ l988XirtFAS(Ai^ct^l«9)^«^ 
(Beck et a!., 1990). Open boxes refer to conservative substitutions or invariant residues in all but one sequence. The N terminus chicken FAS isas- 
^*±Jl2p*M« sequence (Hofcer et ah. 1989). as recently reported (Witkowski el al. 1991a). The KR of SU3. *^«f J™ 
the other eight sequences, is ignored for boxing purposes. The extent of each domain is indicated by underlming of the sequences wrth sobd Wack bar* 
short, heavy dashes, long, heavy dashes, and open bars, representing the KS, AT, KR and ACP domains, r^pectrvely. The 
segn™tsofl52and315^ 

beTween the two arrows indicate invariant and conservative substitutions among the six SU. Computer-assisted 
1 ^versftyofWisc^ 

alignments we performed using MLEUP, with a gap weight of 3.0 and a gap length weight of 0. 1 . The sequences of the sue SU ^^^ y ^^? x 
Subsequently, the segments corresponding to the first AT and ACP of SU) (AT-S and ACP-S, respectively) were mdtvidually a^ted wrth the ,«H« six 
AT a^Aci donZ respecuverFmally, the region of SU4 between the DH and ER domains (section c) 
tweentheATandKRdomains were separatery aKgned. The three aJigniitents generated in this way were inanu^^ 

^eryA SU with the other multSctional systems examined, chicken FAS, rat FAS and 1 6MSAS, PILEUP ™™ZIZS^J^ 

grouo^SU^eFASan^^^ 

regionsn^theoAerr^eS^ 

parameters. These segments have thus been ignored for boxing purposes. 
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chicken 

rat 

SU4 

cry St 
Vat-1 
Sty URF 

chicken 

rat 

SU4 

£- cryst 
Vat-1 
Sty URF 

chicken 

rat 

SU4 

C-cryat 
Vat-1 

chicken 

rat 

SU4 

cryst 
Vat-1 



HNVYRDGKWG 
MNVYRDGAWG 
PQLALRGDDV 



MTGEEVKEP 



SFRHLPLQQA QPQELTEYAY 
AFRHFQLECD KPEEQTAHAF 
FVPRLSPLA? SALTLPAGTQ 
MATGQKLMRA 
KEQQSITEVK EQEPBISYNA 
MATR 




RWIVSPLR 

iJrwvssplk 

. FEPAP 
.VQSDV 
.VEVKK 
.T.VEF 



HFQ.TTNPNV Q 
HMQPPSSSGA Q 
DVEQPLRAGE VR 1 
AVPIP.KDHQ VLI 
GVPTL.KSDE XV 
TPAEP.AEHE IQV 




■E R D I MI AT 



-EROVLIAI 
-P\)E1Y3R£- 
-ESDILVRC- 

^EjiyilYSRi 




jSPDAIPGN WTLGjjG? 
|SPDAIPGK 
>QKADM. . 
RIPLLPY 
IKHHSL. . 

>P.PSLPA GLGiy^GW 



WASFCC* . . . 

. GT E £ G W 
TPGlCVflGW 
GlECflGW 



sgr diIag&Rhgl lpakcOtw DCDKRFLWEV 

LLSPDFLBDV 
.G LFQG^FaJpIA VTDHRLLARV 
LASDHTVYRI 
VTTVKRTFt N 
STVG4^SVH NVTADKAAljl 



KVGDKIIWLN 
RvbotMlYAQ 



MEfjSGR DKCGPBVHGL VP AEG Lfl TSV 
AFGDRVI..G LFQGAFAPIA 
KKGDFVFTTS TISGC YflEYA 



IDGGIWTELV 



DGfSFC EA- 



jDA|gSFE|QA- 



. .(LG-MEje 

. . L 

TAV-jpDhA)AF 
ESI-NDVSAF 
Efil-Dl VIDR 
SltyNGlaEHI 

fi^ftSsAEKR EYLQARipQL BaNSFASSRN T10EQH0LRV TNGKGVNLV1 
CDTSFANSRD TSppQH^LLH TGGjK- 
CD SHI ASS RE 

GAHEVFNHRD w]y|iDe{3]kKS IGE 
gVTYPIDYTT LDjJAEEjyRK. lApjlC 



SSftTLEEA 

- DC ft SDJ DA- RVElASTTflH YJIHDLAC1R ACQSV 



EiJi|dFf|qG-| Uldl|ptJE|TAfc WIFHSAFfK ACESV-V-Gfi 
RISV1NX1AAY VMYDFAHIR PSQSI-1-y* 

^FQiSfcF Yij^TyEiyK pe ctf-f-| f^ 



ptei^rLEjEAS SveW^Wt^y yjJDvvrggSk KGBsjVLirig 

|VFj^V^F*P YSpjvVRGFpjQ HG]e}i(v-I-S G [s 

I- PA A- 



SGGvdAdiHC hrlejSlarv 

Q2AXS IALSIG.CRV 

HJ>AVA LARRAG . AEV 

LJACQ IARAK.LKV 

IIATQ LCKLVH .DVT 

IlJaCQ WAKAlJg.AKL 



T1VCSAEKR AYLQAF|F|PQI. 
IAIfGPAKHG T... 
1GTJGTEEGQ K...V 
^FmdsPSKH E...T 



LFfLGL 
YLQN 
3 KEN 



•DLV1 
■DWI 
K— DVII 
C--DIVI 



SVRC-AQ*-* 



nJsOeEKLQA 

-AEEKLQA 

-1GELLDE 

-ANVNLSN 
D^tyGADDSK cjFjGlijKPiyC 



SADI 



AEI - V 



E 1 KI - SCO * 



NN SQLGM 

KFDLS NN HPLGM 



fiHicIkfdls 

FIEI- 
FVEM 

vuv-Jc 

LVIY 



.KT 



RGSIEI 

SANQV TAPKRSSLAA AKVWWHKFNI 



ALFLKNVAFH 
AIFLKNVTFH 
DLRDAGDFRG 
NPRDTMAKES 
DALQLINSNK 



3llds(2fee gnBe^e^ 



GILLDHFEG ANDSfcREVAE IIKAGIRD-V ^KPLKCIVEP KACVED 



G 
G 

RYlAPF 



■qi|G£A 

AyCGF#3RT DPy#^EyiR 



GDDF1GE3LR 



EVVGIIGA-E 

tWsGVs|i|fSS TKydrjodFjAS 1 1 QAG f EL - W 

HlyistctKEyK 



se iHtk(£|kdSv RKPLRiflvSG kee^eaHfrf S^cBchigkv b 



IDRLPVSME 
VKPVIGSC^P 

jJxPKVDyMgs 



LGSflPA 



■FRY 
■LQH 
LEriAlsd-HEM 
FEcjaGEQMRH 



Ikiqeeekq YPLRSEPVKL 
11GVREEEPE AMLPGAQPTL 

VITQPAPVD 

SGTVGK 1VLLM* 

S0SLKSRQLM PQLEIKSVSK* 



NACGtCHIGKV 
b SFGJRHVGKL 
lit 

tfR^T^TLEKS 



F« 3 Putative cnoyl reductase domains. The segment from EryAII comprised between aa 2832 and 3138 was employed to sown the OenBank and 
EMBL (48285 sequences) and the Swissprot (20722 sequences) databases using <he programs TFASTA and FAST A, respectively. Sequences showtng 
significant matches were aligned using P1LEUP. Invariant aa residues are represented by dashes; dots refer to mserted ^if^Zlr^Z^ 
all the sequences are boxed. An asterisk indicates the slop codon in the corresponding gene. Sequences as indicated: chtcken, FAS from dnekn. aa 
SslSrauFASfromrat,^ 

et al 1989)- VAT-1. membrane protein from Toledo coKfomica cholinergic synaptic vesicles, complete sequence (Lmlal et al.. 989), Sty URF. S^bno- 
tt'SuriL unidentified open reading frame divergent from dnaB, translated from OenBank J03390 (Wong el al.. 1988). Note that only the 5' end 

of this sequence is available, with ibe resulting polypeptide ending as KALGAKL 1 . 



contain a DH function (Beck et al., 1990). The alignment 
of these four sequences (Fig. 4) indicates that significant 
homology is limited to an approx. 150-aa segment. Within 
it, the invariant HxxxGxxxxP motif is embedded in a 25- 
aa segment with a high degree of conservative substitu- 
tions, involving mostly hydrophobic residues. The finding 
of an invariant His residue in the most conserved region 
among the four sequences is consistent with the proposed 
role for a His as the active-site residue in the E. coli fi- 
hydroxydecanoyl thioester dehydrase (Bloch, 1971). The 
corresponding gene encodes the active-site His in the se- 
quence HFIGDPVMP 78 (Cronan etal, 1988), where in- 
sertion of a gap between I and G would conform this se- 
quence to the proposed consensus. The HxxxGxxxxP motif 



is also found at a single position in the S. cerevisiae FASi 
sequence as HLSHGVKMIP l0S7 , approximately where the 
DH domain has been tentatively placed (Schweizer et al., 
1986; Chirala et al., 1987). These observations suggest that 
the DH domain in multifunctional FAS and PKS systems 
is relatively short, extending for approx. 140-170 aa, con- 
sistent with the 170-aa size of the £. coli enzyme, and point 
to a specific His as one of the active-site residues involved 
in catalysis. The same His has been independently pro- 
posed by P-F. Leadlay (personal communication) as the 
catalytic residue in the DH domain of EryAII, It should 
also be noted that the two animal FASs and SU4 also 
share the motif GYxYGPxFQ, approx. 1 10 aa after the 
proposed active-she His, whereas 6msas does not. The 
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Fig. 4. Putative dehydratase domains. The approx. 500-aa segments from EryAII, chicken FAS, rat FAS and 6msas suspected to contain the DH do-, 
main (see section c) were aligned using PILEUP. Only the portion showing significant matches is represented. For abbreviations and symbols, refer to 
Fig. 3. The three blackened squares denote the putative active-site motif HxxxGxxxxP. The shaded bar denotes the highly conserved region common only 
to the two FAS sequences and SU4. Sequences: chicken, FAS from chicken, aa 812-987; rat, FAS from rat, aa 837-1009; SU4, S. erythraea EryAII, 
aa 2365-2551; 6msas, PenkUUum patulum 6MSAS, aa 914-1096. 
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reason for this difference and a possible role for this motif 
are at present unknown. No significant matches were de- 
tected by database searching with the proposed DH do- 
main or by comparison with known aa dehydratases. 

All six eryA SU contain a segment corresponding to a 
KR domain, although the KR domain of SU3 is believed 
to be non-functional (Donadio et al., 1991). The beginning 
of the KR domains is likely to coincide with the region 
following the ER domain where SU4 realigns with the other 
five SU. This location was matched in the N-terminal por- 
tion of the monofunctional KRs involved in actinorhodin 
(Hallam et al., 1988) and granaticin (Sherman et al., 1989) 
synthesis. Thus, the ery KR domains are likely to start with 
the PxGTvLv motif, just upstream from the putative 
NADPH-binding site (Fig. 2). Since in all multifunctional 
FAS and PKS systems the KR is always followed by an 
ACP, the end of the KR domains was placed approx. 
190 aa after the NADPH-binding site, where conservation 
among the nine sequences examined began to decline 
(Fig. 2). This interpretation results in the separation of the 
KR and ACP domains of 90-100 aa in the PKS systems 
and of 60 aa in the FAS systems. 

The C-terminal end of SU6 contains a TE domain. This 
domain was compared with the corresponding domains 
from the two FAS sequences, with monofunctional 
thioesterases from rat (Randhawa and Smith, 1987; S af- 
ford et al., 1987) and duck (Poulose et al., 1985), and with 
the TE-like ORF downstream from eryF (Weber et al., 
1991). The alignments (Fig. 5) indicate that the TE domain 
in eryA extends for approx. 230 aa, and includes, in addi- 



tion to the invariant GxSxG motif common to ATs and 
serine proteases, the GdH motif found near the C-terminal 
end, which has been shown by site-directed mutagenesis to 
be essential for activity (Witkowski et al., 1991b). Overall, 
little similarity was detected among the six TEs analyzed, 
which may be related to the three different classes of sub- 
strates recognized by these enzymes (Wakfl, 1989; Cortes 
etal., 1990; Donadio etaL, 1991). This is exemplified by 
the low similarity between the TE domain of rat FAS and 
the short chain TE from the same organism, and between 
the TE of SU6 and the other TE-like sequence from the ery 
cluster, although the role of the latter has not yet been 
determined. 

(d) Inter- and extradomain regions 

The overall domain organization of the three eryA- 
encoded polypeptides is summarized in Fig. 6. It can be 
seen that the largest interdomain regions are the five seg- 
ments between AT and KR, and the one between DH and 
ER in EryAII. When the AT-KR interdomain regions from 
SU 1, 2, 3, S and 6 were compared with the region from 
SU4 containing the DH and ER domains, some similarity 
could be detected under relatively stringent conditions (data 
not shown). Computer-generated alignment of these six 
segments indicated that these regions can be best accom- 
modated after accounting for two insertions in SU4, the 
first of 152 aa, and the second of 3 IS aa (Fig. 2). These two 
insertions correspond very closely to the DH and ER do- 
mains, respectively, as determined above. In the 200-aa 
segment which joins the DH and ER domains in SU4, a 
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Fi& 5. Thioeslerase domains. The six TEs were compared using PJLEUP. See legend to Fig. 3 for symbols. Sequences: rat, FAS from rat, aa 2209- 
2505; chicken, FAS from chicken, aa 2193-2497; rat SC> short-chain TE from rat, complete sequence (Randhawa and Smith, 1987; Safford el al, 1987); 
duck SC, short-chain TE from duck, complete sequence (Poutose et al., 1985); eryORF, downstream from 5. erythraea eryF (Weber et al, 1991 ; GenBank 
accession No. M54983); SU6> S. erythraea EryAJlI, aa 2927-3170. 
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Fig, 6. Domain organization in 6dEB synthase, Each domain is repre- 
sented by a rectangle of different filling as shown, whose length is pro- 
portional to the length of the domain. Note the partial filling of the first 
KR of EryAII, which denotes an inactive KR (Donadio et al, 1991). 



stretch of approx. 80 aa appears to be fairly well conserved 
among the six SU (Fig. 2). The enzyme from S. erythraea 
AKR5, which has been deleted of this 80-aa segment along 
with the KR domain C-tenninal to it in SUS, retains ac- 
tive SU5 and SU6, as judged by the ability to produce a 
significant amount of the 6dEB analog lacking the hydroxyl 
group introduced during synthesis step S (Donadio et al., 
1991; Fig. 1). Whilst Witkowski et al. (1991a) have spec- 
ulated that the long DH-ER interdomain segment is in- 
volved in facilitating protein-protein interactions in the 
dimeric FAS enzyme, our results indicate that the presence 
of at least a portion of this segment is not absolutely re- 
quired for 6dEB PKS function, although the level of ac- 
tivity of the altered enzyme could not be measured directly. 
That the deleted segment is important for KR activity is 
improbable, since very little homology was detected with 
other multifunctional systems in this region (data not 
shown). 

An additional feature of the ery/t-encoded polypeptides 
is the presence of extra N-terminal and C-tenninal tails 
extending significantly beyond the domain limits (Fig. 6). 
The N termini of polypeptides EryAII and EryAHI contain 
segments of 26 and 33 aa preceding the KS domains of 
SU3 and SUS, respectively, and the C termini of EryAI 
and EryAII contain stretches of 69 and 63 aa following the 
ACP domains of SU2 and SU4, respectively. The other 
multifunctional systems examined do not contain extra tails '■ 
of such lengths. In 6msas a 28-aa segment precedes the KS 
domain, but the ACP domain is followed by a segment of 
only 6 aa at the C terminus of the polypeptide. The KS 
domain of rat FAS starts at the N terminus of the protein. 
It is tempting to speculate that these additional segments 
in the eryyi -encoded polypeptides may play a role in facil- 
itating the correct intermolecular transfer of the growing 
acyl chain, such as from SU2 in EryAI to SU3 in EryAIII, 
either by enabling specific protein-protein interactions, or 



by properly positioning the polypeptides on some cellular 
structure. 

(e) Ev lution of the modules 

S. erythraea contains a Type-I PKS and, most likely, a 
Type-II FAS system (Revill and Leadlay, 1991). The ev- 
olutionary origin of these two systems can be understood 
by comparison of similar enzymatic functions belonging to 
a Type-I or Type-II system from different sources, as ex- 
emplified for the 21 ACPs presented in the dendrogram in 
Fig. 7. The eryA ACPs are closely related to each other, 
except for ACP-S, which, as described above, does not 
function as all other known ACPs and is less related to the 
other SU ACPs than the ACP from 6msas. Nonetheless, 
the Type-I PKS ACPs appear to be clustered together in- 
dicating greater overall similarity amongst each other than 
with ACPs from other systems. Similarly, the Type-II PKS 
ACPs form their own cluster as do both the Type-I and the 
Type-II FAS ACPs (Fig. 7). The determination that the 
SU ACPs more closely resemble Type-I FAS systems than 
the monofunctional FAS ACP from the same host suggests 
that Type-I PKSs and Type-I FASs had a common an- 
cestor. This hypothesis is corroborated by the observation 
of a similar pattern when the six eryA KSs were compared 
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Fig. 7. Relatcdness of ACPs and ACP domains. The ACP domains from 
multifunctional FAS and PKS systems (see Fig. 2) and the monofunc- 
tional ACPs are shown as the dendrogram obtained from PILEUP. ACP 
sequences: WhiE, spore-associated pigment genes from Streptomyces 
coellcobr (Davis and Chater, 1990); Barley I and in, forms I and III 
(Hansen, 1987); Rape, Brassica napus plastid seed (Safford et al., 1988); 
R. meuloti, Rhizobium meMoti constitutive ACP (Plan et ai„ 1990); E. cob', 
£. coU FAS ACP (Holak et al* 1988); Synechocystis, Synechocystis 6803 
(Froehlich et al., 1990); S. erythraea, S. erythraea putative FAS ACP (Re- 
vill and Leadlay, 1991); NodF, R. metiloti nodulation-specific ACP (De- 
belle and Shanna, 1986). 
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with eight other Type-I and Type-II sequences (data not 
shown). 

The finding of a stretch of the DH-ER interdomain re- 
gion in the SU lacking these two functions is also consis- 
tent with the hypothesis that the eryA modules are likely to 
have evolved from an ancestral element (FAS- or PKS- 
like) which encoded the full set of activities involved in the 
processing of the /J-carbonyl (DH, ER and KR), followed 
by loss of the functions not required at particular steps of 
6dEB synthesis. Two modes of specialization through loss 
of function seem to have occurred in the eryA modules: 
selected mutations in the KR-encoding domain in module 
3, and loss of the DH- and ER-encoding segments in all of 
the modules except module 4. Loss of function (ER) 
through extensive deletion may have also taken place in 
6msas. It will be interesting to analyze the sequences of 
other PKS systems lacking KR, DH or ER domains to 
better understand the mode of evolution of pathways for 
complex polyketicles. 

(f) Conclusions 

Our results on the extent of the various domains in the 
six eryA SU, determined solely by computer-assisted align- 
ments, can be extended to other related systems and are 
substantially in agreement with those independently found 
by P.F. Leadlay and colleagues (personal communication) 
and by Witkowski et al. (1991a), who corroborated their 
computer analysis with limited proteolysis studies. The ex- 
istence of multiple sequences with identical function in eryA 
has greatly facilitated assignments of the various domains. 
We have proposed a location for the DH domain and a 
putative active-site His for it. Type-I FAS and PKS sys- 
tems also seem to share a common origin independent of 
their prokaryotic or eukaryotic source. 
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Cter Edge of ACP (last domain of module) 

& Nter edge of KS (1 st domain of next module) 

-all such examples from the rapamycin PKS & the erythromycin PKS 

| ACP 

10 20 30 40 50 

ACP13/KS14-rap LTGRTSVEQHRIMLELVLE-RRSVLGHSSADAIAT 

ACP12 / KS13 - rap LAALAPAEREDALLKLVRDSAALVLGHADASTI PA 

ACP 1 1 / KS 1 2 - rap LAALAP AEREKALLKLVSDGAATVLGHADTST I PA 

ACP09/KS10-rap LAALAPAEREKALLKLVSDGAATVLGHADTSTIPA 

ACP 0 8 / KS 9 - rap LAALAP EERAKALVKWCDSAATVLGHADVDS I PV 

ACP07 /KS8-rap LARLAPVEREKALLKLVCDGAATVLGHADASTIPA 

ACP06/KS7-rap LAALAPAEREKALLKWCDSAAWLGHADARTI PV 

ACP0 5/KS6- rap LAALAP EERAKALLKVVRDTAATVLGHAD ART I PV 

ACP03/KS4-rap LAALAPAEREKALLKLVCDSAAMVLGHADARSI PA 

ACP02/KS3 - rap LAALAP AEREKALLKLVCDSAATVLGHADTSTVSV 

ACP0 1/KS2- rap LARLAPVEREKALLKLVCDGAATVLGHADASTI PA 

ACP00/KSl-rap LAAEAAREQALRDLVRSSVTDILGLSAADRYAP 

ACP5-KS6 ery LAALSTAERREHLAHLI RAEVAAVLGHGDDAAI DR 

ACP3 - KS4~ery LAGLSPDEQQENLLELVANAVAEVLGHESAAEINV 

ACPl-KS2~ery LASLPAPEREEALFELVRSHAAAVLGHASAERVPA 

ACPO-KSlIery GREADAEATF 



60 70 80 90 ?00 

ACP13 /KS14 -rap DTSFKDLGMDSLTAIELRNRLVAETGLQLPATMVFDYPTANALAAHLLGK 

ACP12/KS13 - rap AAAFKDLGI DSLTAVELRNSLAKATGLRLPNTTVFDYPTP AI LATRLG - - 

ACP11/KS12 - rap TTAFKDLGINSLTAVELRNSLAKATELRLPATLVFDYPTPAALAARLD - - 

ACP0 9 / KS 1 0 - rap TTAFKDLGIDSLTAVELRNSLAKATELRLPATLVFDYPTPTALAARLD- - 

ACP 08 /KS9- rap TAAFRDLGVDSLTAVELRNSLTKATGLRLPATLVFDYPTPGALAARLE 

ACP07/KS8-rap TAAFKDLGIDSLTAVELRNSLTKATGLRLPATLVFDYPTPTALAARLG-- 

ACP06 /KS7-rap TGAFKDLGVDSLTAVELRNSLVKATGLRLPATMVFDYPTPTALAARLD- - 

ACP05/KS6-rap TGAFRDLGI DSLTAVELRNGLAKVTGLRLPATLVFDYPTPAVLAARLG 

ACP 03 /KS4 - rap AGAFKDLGVDSI^VELRNGLVKATGLRLPATLVFDYPTPTVLAARLD- - 

ACP02/KS3 - rap AAVFRDLGVDSLTAVELRNSLAKATGLRLPATLVFDYPTPTALAVRLG- - 

ACP0l/KS2-rap TGAFRDLGVDSLTAVELRNGLAKATGLRLPATLVFDYPTPAALAARLE- - 

ACP0 0/KS1 - rap DKTSREMGIDSLTSVELRNSLAKATGLRLPATLVFDYPTPAVLWRLG- - 

ACP5-KS6 ery DRAFRDLGFDSMTAVDLRNRLAAVTGVREAATWFDHPTITRLADHYL- - 

ACP3-KS4 ery RRAFSELGLDSLNAMALRKRLSASTGLRLPASLVFDHPTVTALAQHLR- - 

ACPl-KS2~~ery DQAFAELGVDSLSALELRNRLGAATGVRLPTTTVFDHPDVRTLAAHLA- - 

ACP0-KSl~ery R ELGLDSVLAAQLRAKVSAAIGREVNIALLYDHPTPRALAEALA- - 

~ ..**..**- • • •* * * 



110 120 130 J 140 150 

ACPI 3 /KS14 - rap LDI PPVQQRLEAPAPSTVTGPADPVADEPSANEPI AIVAMACRLPGGVSS 

ACP 1 2 / KS 1 3 - rap ELFTGENPAPVRPSVS WGQD EPLAWGMACRLPGGVSS 

ACP 1 1 / KS 1 2 - rap ELFTGENPVPVRGPVS AVAQD EPLAI VGWACRLPGGVSS 

ACP09/KS10-rap ELFTGENPAPVRGPVSAVAQD E PLAI VGMACRLPGGVS S 

ACP08/KS9-rap ELFTGENPVQVRTPVSAVGQD EPLAIVGMACRLPGGVSS 

ACP 07 /KS8-rap EWFVGETPVPVRTSVSWAQD EPLAIVGMACRLPGGVSS 

ACP06/KS7-rap E LFTGENPAPVRE PVPAVAQD EPLAIVGMACRLPGGVSS 

ACP05/KS6-rap ELFTGENPVLVR - TASWGQD EPLAIVGMACRLPGGVSS 

ACP03 /KS4 - rap ELFTGENPAPVRGPVSWGQD EPLAIVGMACRLPGGVSS 

ACP02/KS3 -rap ELFTGENPVPVRGPVS AVAQD EPLAIVGMACRLPGGVSS 

ACP0 1 / KS2 - rap ELFTGENPAPVRTSVSWAQD EPLAIVGMACRLPGGVSS 

ACP00/KSl-rap ELFTGESPAPER-AVSAVGQG EPLAIVGMACRLPGGVSS 

ACP5 - KS6 ery ERLVGAAEAEQAPALVREVPK-DADDPIAIVGMACRFPGGVHN 

ACP3-KS4~ery ARLVGDADQAAVRWGAADE SEPIAIVGIGCRFPGGIGS 

ACPI - KS 2~ery AELGGATGAEQAAPATTAP VDEPI AIVGMACRLPGEVDS 

ACPO-KSl~ery AGTEVAQRETRARTNEAAP GE PVAWAMACRLPGGVST 

— .*.*.*.****. 



ACP13/KS14-rap 

ACP12/KS13-rap 

ACPll/KS12-rap 

ACP09/KS10-rap 

ACP08/KS9-rap 

ACP07/KS8-rap 

ACP06/KS7-rap 

ACP05/KS6-rap 

ACP03/KS4-rap 

ACP02/KS3-rap 

ACP01/KS2-rap 

ACPOO/KSl-rap 

ACP5-KS6_ery 

ACP3-KS4_ery 

ACPl-KS2_ery 

ACPO-KSl_ery 



KS (N-terminus) 

160 170 180 190 200 

PEGLWHLVESGTDAISGFPTDRGWDVEGLFDPDPDAAGKSYCVQGGFLDT 
PEDLWRLVESGTDAISGFPADRGWDAESLFDPDPDASGKSYCVEGGFLDS 

PEDLWRLLESGTDAVSGFPTDRGWDVENLY DMAGKSHRAEGGFLDA 

PEDLWRLVESGTDAISGFPTDRGWDVENLYDPDPDAPGKSYSVQGGFLDA 
PEDLWRLVESGTDAISGFPTDRGWDVENLFDSDPDAAGKSYCVEGGFLAT 
PEDLWRLLESGTDAVSGFPTDRGWDVENLFG- - - PAAGDSYRLQGGFLDA 
PEDLWRLVESGTDAVSGFPTDRGWDVEGLFDPDPDAAGKSYRAEGGFLDT 
PEDLWRLVESGTDAISGFPADRGWDAESLFDPDPDAVGKSYCVEGGFLDS 
PEDLWRLVESGTDAVSGFPTDRGWDVENLYDSDPEAAGKSYCVQGGFLDT 

PEDLWRLLESGTDAVSGFPTDRGWDVENLYD MAGKSHRAEGGFLDA 

PEDLWRLLESGTDAVSGFPTDRGWDVENLFG- - - PAVGNSYRLQGGFLDA 
PEDLWRLVESGTDAISGFPTDRGWDVDGLFDPDPDASGKSYCVQGGFLDT 
PGELWEFIVGRGDAVTEMPTDRGWDLDALFDPDPQRHGTSYSRHGAFLDG 
PEQLWRVLAEGANLTTGFPADRGWDIGRLYHPDPDNPGTSYVDKGGFLTD 

PERLWELITSGRDSAAEVPDDRGWVPDELMASD AAGTRAHGNFMAG 

PEEFWELLSEGRDAVAGLPTDRGWDLDSLFHPDPTRSGTAHQRGGGFLTE 

* * * **** * * * ■ 



210 220 230 240 250 

ACP13/KS14-rap AADFDAPFFGISPREALGMDPQQRLLLETTWEAIERAQIDPKSLRGRDVG 

iSS/SlS-raJ AGSFDAGFFGISPREAIiAMDPQQRLIMEVSWEAFERAGIEPGSVRG-THR 

ACPU/SlLSp AAGFDAGFFGISPREALAMDPQQRLVLEVSWEAFERAGIEPGSVRGSDTG 

ACPO^/Slo-Sp AAGFDASFFGISPREALAMDPQQRLMLEVSWEAFERAGIEPGSVRGSDTG 

iSoa/SJ-rao AANFDASFFGISPREALAMDPQQRLVLEVSWEAFERAGIEPGSVRGSDTG 

ACP07/2 8 -rap Sgfdasffgisprealamdpqqrlvlevsweaferagiepgsvrgtdtg 

ACpSe/SlrS aagfdagffgisprealamdpqqrlllevsweaferagiepgsvrgsdtg 

iSSI/rae-ra? aasfdagffgisprealamdpqqrlimevsweaferagiepgsvrgsdtg 

tcPoa/SlrS aagfdagffgisprealamdpqqrlllevsweaferagiepgsvrgsdtg 

JSw/SS-raS aagfdagffgisprealamdpqqrlvlevsweaferagiepgsvrgsdtg 

Si/SLrap aagfdasffgisprealamdpqqrlvlevsweaferagikpgsvrgtdtg 

ACP^/Sl-rap aagfdasffgisprealamdpqqrlvlevsweaferagiepgsvrgsdtg 

ACP5-KS6 ery aADFDAAFFGISPREALA^PQQRQVLETTWELFENAGIDPHSLRGSDTG 

ACP3 -KS4~S AADFDPGFFGITPREALAMDPQQRLMLETAWEAVERAGIDPDALRGTDTG 

acp? SHS AGDFDAAFFGISPREALAMDPQQRQALETTWEALESAGIPPETLRGSDTG 

aS-KsTS ATAFDPAFFGMSPREALAVDPQQRLMLELSWEVLERAGIPPTSLQASPTG 

- * ** ♦»*..***** .*♦*** .* .** ****... 

^ 270 2g0 290 300 

ACP13 /KS14 -rap VYVGGAAQG YGVGVDQQ - - - -HDNGITGSSVSLLSGRVSYALGLEGPGVT 

ISS/Si 3 -III RLHGRVRGGYGAGADL- - - - -GGFAATASATSVLSGRVSYFFGLEGPAIT 

A~™?/Si2-rap VFMGAYPGGYGIGADL GGFGATASSVSVLSGRVSYFFGLEGPAFT 

S S.rJ VFIGAYPGGYGIGADL— -GGFGTTAGAASVLSGRVSYFFGLEGPAFT 

ACp28/S9-ra P VFMGAFPGGYGIGADL EGYGATA-GLNVLSGRLSYFFGLEGPAVT 

ACPol/Ss-rS VFMGAYPGGYGIGADL GGFGATASAVSVLSGRVSYFFGLEGPAIT 

Se/Sv-rX VFIGAFPVGYGAGAAR- - - - - EGYGATA- APNVLSGRLSYFFGLEGPAIT 

ACpSs/SLrap VFMGAYAGGYGAGADL - - - - - GGFAATASATSVLSGRVSYFFGLEGPAIT 

aSoT/Ks!-^ VFIGAFPVGYGAGFDR EGYGATS-GPSYLSGRVSYVFGLEGPAIT 

^Ill/Ml -£l VFMGAYPGGYGAGADL- r - - -GGFAATASATSVLSGRVSYFFGLEGPAFT 

aSS?/S2 -III VFMGAYPGGYGIGADL- - - - - GGFGTTAGAVSVLSGRVS YFFGFEGPAFT 

So" /S? -Ill VFMGGFPGGYGAGADL - — - EGFGATAGAASVLSGRVSYFFGLEGPAIT 

A~CP5-S eS VFLGAAYQGYGQDAWPED - SEGYLLTGNSSAWSGRVAYVLGLEGPAVT 

ACPS-SrS VFVGMNGQSYMQLLAGEAERVDGYQGLGNSASVLSGRIAYTFGWEGPALT 

icPl-SHw VFVGMSHQGYATGRPRPEDGVDGYLLTGNTASVASGRIAYVLGLEGPALT 

icpi-SllS VFVGLIPQEYGPRLAEGGEGVEGYLMTGTTTSVASGRIAYTLGLEGPAIS 



ac 

ACP13/KS14-rap 

ACP12/KS13-rap 

ACPll/KS12-rap 

ACP09/KS10 ; -rap 

ACP08/KS9-rap 

ACP07/KS8-rap 

ACP06/KS7-rap 

ACP05/KS6-rap 

ACP03/KS4-rap 

ACP02/KS3-rap 

ACP0l/KS2-rap 

ACPOO/KSl-rap 

ACP5-KS6_ery 

ACP3-KS4_ery 

ACPl-KS2_ery 

ACPO-KSl_ery 



tive site Cys 

| 310 320 330 340 350 

VDTACSSSLVALHLASQALRQRECSLALVSGVSVMSSPAMFVEFSRQRGL 
VDT ACS S S LV ALHQ AG Y ALRQGE CS LALVGGVTVMAT PQS FVE FS RQRGL 
VDTACSSSLVALHQAGYALRQGECS1ALVGGVTVMATPQTFVEFSRQGGL 
VDTACSSSLVALHQAGYALRQGECSLALVGGVTVMPTPQT FVE FS RQRGL 
VDT ACS S S LV ALHQ AG Y ALRQGE C S LAL I GGVTVMAT PHT FVE FS RQRGL 
VDT ACS S S LVALHQ AGYALRQGECS LALVGGVTVMAT PQTFVE FARQGGL 
MDTACSSSLVALHLAAQALRNGECSMALAGGVTVMATPEVFTEFARQRGL 
VDTACSSSLVALYQAGYALRQGECSLALVGGVTVMATPQSFVEFSRKSGL 
MDTACSSSLVALHI^AQALRNGECSMALAGGVTVMATPEVFTEFARQRGL 
VDTACSSSLVALHQAGYALRQGECSLALVGGVTVMATPELFTEFSRQRGL 

VDT ACS S S LVALHQ AGY ALRQGE C S LALVGGVTVMAT PQT FVE FARQGGL 
VDTACSSSLVALHQAGYALRQGECSLALVGGVTVMPTPQSFVEFSRQRGL 
VDTACSSSLVALHSACGSLRDGDCGLAVAGGVSVMAGPEVFTEFSRQGGL 
VDTACSSSLVGIHLAMQALRRGECSLALAGGVTVMSDPYTFVDFSTQRGL 
VDTACSSSLVALHTACGSLRDGDCGLAVAGGVSVMAGPEVFTEFSRQGAL 
VDTACSSSLVAVHLACQSLRRGESSLAMAGGVTVMPTPGMLVDFSRMNSL 



********* 



